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RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTTN 2.1 e-mail help: patin21helD@uspto.gov or phone 703-306-41 l||&^ax) 
PATENTTN 3.0 e-mail help: patin3help@uspto.eov or phone 703-30(W119J^' Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user -interface to check whether a sequence listing is in. 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPOyS^dard ST.25. . . . J 

; Checker Version 3.0 replaces the pr^ is Y2K 

compliant. Checker aUowsj^^ 



RaW'Scqucncc Listing Error SfWTmary 



fPff I 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

Wrapped Nucleics The numberrtext at the end of each line "wrapped" down to the next line. 

This may occur if your file was retrieved in a word processor after creating K. 
Please adjust your right margin to .3. as this will prevent "wrapping". 

Wrapped Aminos The amino acid numberAext at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating rt. 
Please adjust your right margin to .3. as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 

Non-ASCII 



Variable Length 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue 

As per the rules, each r, or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing 



Patentln ver. 2.0 "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

oequence(s) . Normally, Patentln would automatically generate this section from the 

previousry coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



to V 



11 



12 



13 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Use of "Artificial" 
(NEW RULES) 

Use of <220> Feature 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Sequence(s) missing, if intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

Ji) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTJON:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Use of "Artificiar only as "<213> Organism" response is incomplete, per 1.823(b) of New Sequence Rules. 
Valid response is Artificial Sequence. 

Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANOATORY if <213>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager or any other means to copy file to floppy disk. 
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OIPE 



^ ' ^/^V **** SEQUENCE LISTING DATE: 04/18/2001 

4fr A PATENT APPLICATION: US/09/820,843 TIME: 07 : 3 5 ^ ^ 

Q»] Input Set : A:\ES.txt Corrected Diskette Needed 



Output Set: N:\CRF3\04182001\I820843.ravT 



Q0> APPLICANT: Council of Scientific and Industrial Research 
^<120> TITLE OF INVENTION: A COMPUTATIONAL METHOD FOR THE IDENTIFICATION OF CANDIDATE PROTEINS 
6 USEFUL AS ANTI - INFECTIVES 

8 <130> FILE REFERENCE: Q63915 
10 <140> CURRENT APPLICATION NUMBER: US/09/8 20 f 84 3 
10 <141> CURRENT FILING DATE: 2001-03-30 
10 <160> NUMBER OF SEQ ID NOS : 118 
12 <170> SOFTWARE: Patentln version 3.0 

14 <210> SEQ ID NO: 1 J- \ A l PV\ 

17 <213> ORGANISM: C. jejuni / 

19 <220> FEATURE: ^ — V. ^ 

20 <221> NAME/KEY ^f iisc- feature ^ VYW$<- 

21 <223> OTHER INFORMATION: hiqhiy acidic protein 
24 <220> FEATURE: ^- . 

;--> 25 <221> NAME/KEY : < jiisc - f eature^- > 

26 <223> OTHER INFORMATION: gi | 6967728 
29 <400> SEQUENCE: 1 

31 Met Ala Tyr Glu Asp Glu Glu Asp Leu Asn Tyr Asp Asp Tyr Glu Asn 

32 1 5 10 15 

34 Glu Asp Glu Glu Tyr Pro Gin Asn His His Lys Asn Tyr Asn Tyr Asp 

35 20 25 30 

37 Asp Asp Asp Tyr Glu Tyr Asp Asp Asp Asn Asn Asp Asp Asp Phe Tyr 

38 35 40 45 

40 Glu Met Asp 

41 50 

43 <210> SEQ ID NO: 2 

44 <211> LENGTH: 32 

45 <212> TYPE: PRT 

46 <213> ORGANISM: C. jejuni 
48 <220> FEATURE: 
4 9 <221> NAME/KEY: misc_feature 

50 <223> OTHER INFORMATION: small hydrophobic protein 

53 <220> FEATURE: 

54 <221> NAME/KEY: misc_f eature 

55 <223> OTHER INFORMATION: gi | 6969129 
58 <400> SEQUENCE: 2 

60 Met Thr Met Leu Asp lie Phe Glu lie He Phe He Thr Thr Val Val 

61 1 5 10 15 

63 He He Gly Phe Gly Gly He Val Phe Val Val Thr Lys Glu Lys Lys 

64 20 25 30 

66 <210> SEQ ID NO: 3 

67 <211> LENGTH: 57 

68 <212> TYPE: PRT 

69 <213> ORGANISM: C. jejuni 
71 <220> FEATURE: 



11e://C:\CRF3\Outhold\VsrI820843.htm 



4/18/01 
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RAW SEQUENCE LISTING DATE: 04/18/2001 

PATENT APPLICATION: US/09/820,84 3 TIME: 07:35:07 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\04182001\l820843.raw 

72 <221> NAME/KEY: misc_f eature 

73 <223> OTHER INFORMATION: putative coiled coil protein 

76 <220> FEATURE: 

77 <221> NAME/KEY: misc_f eature 

78 <223> OTHER INFORMATION: gi 16968493 
81 <400> SEQUENCE: 3 

83 Met Ser Phe Glu Glu Asn. Leu Lys His Ala Asn Glu Ser Leu Glu Lys 

84 1 5 10 15 

86 Leu Asn Asn Gin Glu Leu Ala Leu Asp Glu Ser Val Lys lie Tyr Lys* 

87 20 25 30 

89 Glu Gly Leu Glu Ser lie Lys Lys Ala Arg Leu Glu Leu Glu Lys Ala 

90 35 4C 45 

92 Lys Leu Glu Val Glu Gin lie £sp Glu 

93 50 55 

95 <210> SEQ ID NO: 4 

96 <211> LENGTH: 542 

97 <212> TYPE: PRT 

98 <213> ORGANISM: C. jejuni 

100 <220> FEATURE: 

101 <221> NAME/KEY: mi sc_f eature 

102 <223> OTHER INFORMATION : highly acidic protein 

105 <220> FEATURE: 

106 <221> NAME/KEY: misc„f eature 

107 <223> OTHER INFORMATION: gi 16968611 



110 


<4 00> SEQUENCE: 


4 
























112 


Met 


Lys 


lie 


Leu 


Leu 


Leu 


Asn 


Glu 


Asn 


Pro 


Val 


Val 


Ser 


Arg 


Leu 


Val 


113 


1 






5 










10 










15 




115 


Ser 


Leu 


Ser 


Ala 


Lys 


Lys 


Met 


Ser 


Tyr 


Asp 


Phe 


Glu 


Glu 


Leu 


Asn 


Ala 


116 








20 










25 










30 






118 


Tyr 


Ser 


Glu 


Asn 


Leu 


Gly 


Asn 


Tyr 


Asp 


Val 


He 


Val 


Val 


Asp 


Ser 


Asp 


119 






35 










40 










45 








121 


Thr 


Pro 


Ala 


Pro 


Leu 


Lys 


He 


Leu 


Lys 


Glu 


Lys 


Cys 


Asp 


Arg 


Leu 


He 


122 




50 










55 










60 










124 


Phe 


Leu 


Ala 


Pro 


Arc 


Asn 


Gin 


Asn 


Val 


Glu 


Asp 


He 


Asp 


Ala 


Gin 


He 


125 


65 










70 










75 










80 


127 


Leu 


Gin 


Lys 


Pro 


Phe 


Leu 


Pro 


Thr 


Asp 


Phe 


Leu 


Asn 


Leu 


Leu 


Asn 


Asn 


128 










85 










90 










95 




130 


Lys 


Asp 


Ala 


Asn 


Lys 


His 


Thr 


Ser 


lie 


Asp 


Leu 


Pro 


Met 


Leu 


Ser 


Asn 


131 






100 










105 










110 






133 


Asp 


Glu 


Asn 


Pro 


Tyr 


A3 a 


Asp- 


He 


Ser 


Leu 


Asp 


Leu 


Asp 


Asn 


Leu 


Asn 


134 






115 










120 










125 








136 


Leu 


Asp 


Asp 


Leu 


Pro 


Asp 


Glu 


Asn 


Ser 


Leu 


Asp 


He 


Asn 


Ser 


Glu 


Gly 


137 




130 










135 










140 










139 


Met 


Glu 


Asp 


Leu 


Ser 


Phe 


Asp 


Asp 


Asp 


Ala 


Gin 


Asp 


Asp 


Asn 


Ala 


Asn 


140 


J45 










150 










155 










160 


142 


Lys 


Thr 


Leu 


Glu 


Thr 


Gin 


Asn 


Leu 


Glu 


His 


Glu 


Thr 


He 


Lys 


Glu 


Gin 


143 








165 










170 










175 




145 


Thr 


Gin 


Glu 


Asp 


Thr 


Gin 


He 


Asp 


Leu 


Asp 


Leu 


Thr 


Leu 


Glu 


Asp 


Gly 


146 








180 










185 










190 
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RAW SEQUENCE LISTING DATE: 04/18/2001 

PATENT APPLICATION: US/09/820,843 TIME: 07:35:07 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\0418200L\I820843.raw 



1 A Q 
14 O 


pi,, 
GIU 


Cot* 


Glu 


Ly s 


Glu 


Asp 


Leu 


Ser 


Gin 


Glu 


His 


Thr 


Ala 


Leu 


Asp 


Thr 


14 y 






19 5 










200 










205 






Glu 


151 


Glu 


Pro 


Cor 


Leu 


As p 


Glu 


Leu 


Asp 


Asp 


Lys 


Asn 


Asp 


Glu 


Asp 


Leu 


i en 
Ijz 




Z X u 










215 










220 










104 


lie 


Ly s 


Glu 


As p 


Asp 


Lys 


Asn 


Glu 


Glu 


lie 


Glu 


Lys 


Gin 


Glu 


Leu 


Leu 


■ICC 

ID J 


Z ZD 








2 30 










235 










240 


157 


Asp 


Asp 


otrx 


Lys 


Thr 


Asn 


Thr 


Leu 


Glu 


Met 


Gin 


Glu 


Glu 


Leu 


Ser 


Glu 


ICQ 

1 bo 








24 5 










250 










255 




i a n 
lbu 


Car 


n n 

Lj X 11 


AS 
sp 


Asp 


Asn 


Ser 


Asn 


Lys 


Thr 


Leu 


Glu 


Thr 


Gin 


Asn 


Leu 


Glu 


lo 1 


















265 










270 






lb 3 


HIS 


Asp 


Asn 


Leu 


Glu 


Gin 


Giu 


Thr 


He 


Lys 


Glu 


Gin 


Thr 


Gin 


Glu 


Asp 


1 C A 

164 




Z l D 










280 










285 








166 


Thr 


Gin 


i ie 


Asp 


Leu 


Asp 


Leu 


Thr 


Leu 


Glu 


Asp Giy Glu 


Ser 


Glu 


Lys 


lb / 














295 










300 










169 


Glu 


Asp 


Leu 


oer 


o X 11 




His 


Th^ 


Ala 


Leu 


Asp 


Thr 


Glu 


Pro 


Ser 


Leu 


170 










•1*1 0 
o x u 










315 










320 


172 


Asp 


Glu 


Leu 


Asp 


Asp 


Lys 


Asn 


Asp 


Glu 


Asp 


Leu 


Glu 


Asp 


Asn 


Lys 


Glu 


17 3 








J 










330 










335 


Glu 


l/D 


Leu 


bin 


Mid 


As n 


lie 


Ser 


Asp 


Phe 


Asp 


Asp 


Leu 


Pro 


Glu 


Val 


Glu 


17b 








"3 a n 










345 










350 






178 


Gin 


Glu 


Lys 


VjIU 


Mc L. 


Asp 


c I ltr 


Asp 


Asp 


Leu 


Pro 


Glu 


Asp 


Ala 


Glu 


Phe 


17 9 






T ^ ^ 










360 










365 






Glu 


181 


Leu 


Giy 


Gin 


Ala 


Xjy o 


i > r 


Ren 


Glu 


Glu 


Ser 


Glu 


Glu 


Asn 


Leu 


Glu 


182 




*3 *7 A 




















380 








Phe 


184 


Phe 


Ala 


Pro 


vai 


Val 


GIU 


Pin 
GIU 


Asp 


Tip 

X ic 


Gin 


Asp 


Glu 


He 


Asp 


Asp 


185 


385 










Qfl 

J J V 










395 










400 


187 


Ala 


Ser 


Asn 


Leu 


Cor 


l nr 


olQ 


Asp 


uin 


He 


Lys 


Glu 


Glu 


Leu 


Ala 


Gin 


188 










4 U 0 










410 










415 




190 


Leu 


Asp 


GIU 


Leu 


Asp 


Tyr 


Giy 


Tip 
lit 


Asp 


Ser 


Asp 


Asn 


Ser 


Ser 


Lys 


Val 


191 






A on 

H Z\J 










425 










430 




Giy 


1 " j 


Leu 


UJ.U 


Asp 


Phe 


Lys 


Asp 


Glu 


Pro 


lie 


Leu 


Asp 


Asp 


Lys 


Glu 


Leu 


194 






435 










440 










445 






Phe 


196 


Thr 


Asn 


olU 


Glu 


Glu 


Val 


Val 


Val 


Pro 


Asn 


Leu 


Asn 


He 


Ser 


Asp 


197 




450 










455 










460 










199 


Asp 


Thr 


Leu 


Lys 


Glu 


Ser 


Asp 


He 


Gin 


Glu 


Ala 


Leu Giy 


Glu 


Glu 


lie 


200 


465 










470 










475 










480 


202 


Leu 


Glu 


Lys 


Asn 


Glu 


Glu 


Pre 


He 


Val 


Ser 


Asp 


Val 


Thr 


Lys 


Asp 


Asp 


203 








485 










490 










495 




205 


Asn 


Ser 


Glu 


Glu 


He 


Val 


Asn 


Glu 


Leu 


Ser 


Gin 


Ser 


He 


Ala 


Giy 


Ala 


206 








500 










505 










510 






208 


lie 


Thr 


Ser 


Ser 


lie 


Lys 


Asp 


Asp 


Thr 


Leu 


Lys 


Ala 


Ala 


Leu 


Lys 


Giy 


209 






515 










520 










525 








211 


Met 


Asn 


Met 


Asn 


He 


Asn 


He 


Asn 


He 


Ser 


Phe 


Lys 


Glu 


Asp 







212 530 535 

214 <210> SEQ ID NO: 5 

215 <211> LENGTH: 172 

216 <212> TYPE: PRT 

217 <213> ORGANISM: C. pneumoniaeCWL029 
219 <220> FEATURE: 



4/18/01 



k 
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RAW SEQUENCE LISTING DATE : 04/18/2001 

PATENT APPLICATION: US/09/820,843 TIME: 07:35:07 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\04182001\I820843.raw 

220 <221> NAME/KEY: misc_f eature 

221 <223> OTHER INFORMATION: histone like protein 2 

224 <220> FEATURE: 

225 <221> NAME/KEY: misc_f eature 

226 <223> OTHER INFORMATION: gi 14376663 



231 


Met 


lie 


Gly 


Ala 


bin 


Lys 


Lys 


uin 


Cor 






Lys 


Thr 


Ala 


Ser 


Arg 


232 


1 








c 










10 










15 




234 


Ala 


val 


Arg 


Lys 


Pro 


t\Lo. 


Lys 


Lys 


Val 


Ala 


Ala 


Lys 


Arq 


Thr 


Val 


Lys 


235 








on 










25 










30 






237 


Lys 


Ala 


Thr 


Val 


Arg 


Lys 


Thr 


Ala 


Val 


Lys 


Lys 


Pro 


Ala 


Val 


Arg 


Lys 


238 




35 










40 










45 






Thr 


240 


Thr 


Ala 


Ala 


Lys 


Lys 


Thr 


Val 


Ala 


Lys 


Lys 


Thr 


Thr 


Ala 


Lys 


Arg 


241 




50 










55 










60 








Ala 


243 


Val 


Arg 


Lys 


Thr 


Val 


Ala 


Lys 


Lys 


Pro 


Ala 


Val 


Lys 


Lys 


Val 


Ala 


244 


65 










70 










75 










80 


246 


Lys 


Arg 


Val 


Val 


Lys 


Lys 


Thr 


Val 


Ala 


Lys 


Lys 


Thr 


Thr 


Ala 


Lys 


Arg 


247 
















90 










95 


Thr 


249 


Ala 


Val 


Arg 


Lyp 


Thr 


Val 


Ala 


Lys 


Lys 


Pro 


Val 


Ala 


Arg 


Lys 


Thr 


250 








100 










105 










110 






252 


Val 


Ala 


Lys 


Gly 


Ser 


Pro 


Lys 


Lys 


Ala 


Ala 


Ala 


Cys 


Ala 


Leu 


Ala 


Cys 


253 






115 










120 










125 








255 


His 


Lys 


Asn 


His 


Lys 


His 


Thr 


Ser 


Ser 


Cys 


Lys 


Arg 


Val 


Cys 


Ser 


Ser 


256 




130 










135 










140 








His 


258 


Thr 


Ala 


Thr 


Arg 


Lys 


His 


Gly 


Ser 


Lys 


Ser 


Arg 


Val 


Arg 


Thr 


Ala 


259 


145 










150 










155 










160 


261 


Gly 


Trp 


Arg 


His 


Gin 


Leu 


He 


Lys 


Met 


Met 


Ser 


Arg 










262 










165 










170 















264 <210> SEQ ID NO: 6 

265 <211> LENGTH: 63 

266 <212> TYPE: PRT 

267 <213> ORGANISM: C. trachomatis 

269 <220> FEATURE: 

270 <221> NAME/KEY: misc_f eature 

271 <223> OTHER INFORMATION: hypothetical protein-possible frameshift with CT593 

274 <220> FEATURE: 

275 <221> NAME/KEY: misc_f eature 

276 <223> OTHER INFORMATION: gi I 3522902 
279 <400> SEQUENCE: 6 

281 Met Phe Thr Leu ?he Leu Cys Glu His Leu Leu Thr Asn He Leu Ala 

282 1 5 10 15 

284 Ser Ser Phe Leu Ala Lys Ser Gin Gly Phe He Thr Leu Val Asn Leu 

285 20 25 30 

287 Phe His Lys He Pro Gly Leu Lys Val He Glu He Thr Cys Leu Ala 

288 35 40 45 

290 Leu Pro Leu Gly He His Ser He He Gly Phe Ser Tyr Leu Leu 

291 50 55 60 

293 <210> SEQ ID NO: 7 

294 <211> LENGTH: 203 



t n /-> /"\ o a / "» 1. 



RAW SEQUENCE LISTING ' ' DATE: 04/18/2001 

PATENT APPLICATION: US/09/820,843 TIME: 07:35:07 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\04182001\I820843.raw 



295 


<212> TYPE: PRT 


























296 


<213> ORGANISM: 


C. trachomatis 


















298 


<220> FEATURE: 


























299 


<221> NAME/KEY: 


misc_f eature 


















300 


<223> OTHER INFORMATION: 


: histone like protein 2 










303 


<220> FEATURE: 


























304 


<221> NAME/KEY: 


mi sc_f eature 


















305 


<223> OTHER INFORMATION: 


rri 1 

gi | 


3328438 
















308 


<4UU> bii(JU EjWUii* . 


7 
























310 


Met Asn Met Leu 


uxy 


Val 


tj in 


Lys 


Lys 


vjf i? 


Ser 


Thr 


Arg 


Lys 


Thr 


Ala- 


311 


1 


tr 
D 










1 0 
1 u 










15 




313 


Ala Arg Lys Thr 


vai 


Val 


Arg 


Lys 


Pro 


Ala 
nla 


Ala 
nla 




L<y s 


X 111 


A 1 a 


Ala 


314 


Z U 


























316 


hyS LyS Ala. fiO 


va i 


Arg 


Lys 


Va 1 
Val 


Al a 
nx a. 


Al a 
nla 




Lys 


Thr 


Va 1 
vai 


Ala 


Arg 


317 


■J D 








40 










45 








319 


LyS 1 Mr Val Ala 


Lys 


Lys 


i ill. 


Val 


Al a 


Ala 
nla 


nxg 


Lys 


tr 1 \J 


Val 


Ala 


Lys 


320 








D D 










60 










3 22 


Lys Ala Thr Ala 


Lys 


Lys 


Ala 


IT 1 U 


Val 


Arg 


Lys 


Val 


h 1 a 
nl a 


Hlu 


Ivy o 


Lys 


323 


65 














75 










80 


325 


Thr Val Ala Arg 


Lys 


Thr 


Vol 


Ala 


Lys 


Lys 


Thr 


Val 


A 1 A 

nla 


A 1 a 
n La 


Aru 


Lys 


326 




85 










90 










9 5 




328 


Pro Val Ala Lys 


IjVS 


Ala 


Thr 


Ala 


Lys 


Lys 


Ala 


Pro 


Val 


Arg 




Ala 


329 


100 










105 










110 






331 


V«l Ala Lys Lys 


Thr 


Val 


Ala 


Arg 


Lys 


Thr 

1 Il.L 


Va 1 
val 


Ala 


Ly<= 


Lys 


Thr 


Val 


332 


115 








120 










125 








334 


Ala Ala Arg Lys 


Pro 


Val 


Ala 


Lys 


Arg 


Val 


Ala 


Ser 


Thr 


Lys 


Lys 


Ser 


335 


130 






135 










140 










337 


Ser lie Ala Val 


Lys 


Ala 


Gly 


Val 


Cys 


Met 


Lys 


Lys 


His 


Lys 


His 


Thi 


338 


145 




150 










155 










160 


340 


Ala Ala Cys Gly 


Arg 


Val 


Ala 


Ala 


Ser 


Gly 


Val 


Lys 


Va 1 
vai 


Cys 


Ala 


Ser 


341 




165 










170 










175 




34 3 


Ala Ala Lys Arg 


Lys 


Thr 


Asn 


Pro 


Asn 


Arg 


Ser 


Arg 


Thr 


Ala 


His 


Ser 


344 


180 










185 










190 






346 


Trp Arg Gin Gin 


Leu 


Met 


Lys 


Leu 


Val 


Ala 


Arg 












347 


195 








200 


















349 


<210> ~EQ ID NO: 


: 8 
























350 


<211> LENGTH: 372 
























351 


<212> TYPE: PRT 


























352 


<213> ORGANISM: 


H. 


influenzae 


















354 


<220> FEATURE: 


























355 


<221> NAME/KEY: 


misc_f eature 


















356 


<22J> OTHER INFORMATION: 


: outer membrane 


integrity protein (colA 


359 


<220> FEATURE: 


























360 


<221> NAME/KEY: 


misc_f eature 


















361 


<223> OTHER INFORMATION: 


; gil 


1573353 
















364 


<400> SEQUENCE: 


8 
























366 


Met Gin Asn Asn 


Arg 


Gin 


Lys 


Lys 


Gly 


He 


Asn 


Ala 


Phe 


Ala 


He 


Ser 


367 


1 


5 










10 










15 




369 


lie Leu Leu His 


Phe 


He 


Leu 


Phe 


Gly 


Leu 


Leu 


He 


Leu 


Ser 


Ser 


Leu 
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<210> 32 

<211>. 497 

<212> PRT 

<213> B. burgdorferi 
<220> 

<221> misc_feature 

<223> predicted coding region BB0553 



<220> 

<221> misc_feature 
<223> gi|2688482 



<400> 32 

Met Asn Lys Thr Lys Asn Arg Ser Leu Thr Tyr Phe He He Leu Ser 
15 10 15 

Cys He Ser Leu Phe Gly Ala Asn Asn Asn Thr He Ser Tyr Ser Ser 

20 25 30 Jj j /) 



He Glu He Pro Leu Glu Asp Leu Ser Glu Glu Phe Lys Ser Ser Gly 
35 40 45 

Asn Lys Ser Asp Gin He Asn Thr Ser Lys His Leu Asn Lys Asn He 

50 55 60 f(l 

Val Ser Tyr Glu Asp Pro Lys Lys Gly Lys Asp Leu Lys Leu Pro Glu 
65 70 75 80 

Asn He Arg Asp Lys Lys Leu Pro Gin Lys Arg Met Asp Glu Asn Asp 
85 90 95 

Leu Lys Ser Val He Glu Asn Tyr Glu Asn Lys He Lys Asn He Glu 
100 105 110 

Lys Leu Leu Lys Thr Lys Asn Gin Lys Thr Ser Glu Asn Glu Asn Lys 
115 120 125 

Lys He Glu Ser He Glu Lys Lys Ala Lys Lys Tyr Glu He Leu Thr 
130 135 140 

Asn Lys Leu Lys Asn Glu He Val Glu lie Lys Lys Leu Leu Asn Lys 
145 150 155 160 

Lys He Lys Pro Lys Glu Asp Glu Asn Tyr Glu Lys He Asn He Glu 
165 170 175 



j2/Ia^W 



Asn He Glu Glu Glu Thr Asp Asp Asp Phe Glu Asp Asn Tyr Glu Tyr 
180 185 190 



/O 



Asn Asp Glu He Glu/Xaa jThr Asn Glu Asp Asn Tyr Pro Ser Asn Glu 
. 195 200 205 (fy\ 

r yM Gly He He Asn Asn Leu Lys Glu Asn Leu Asn Glu Asn Glu Lys Tyr JL*syt~^>r^^ 

\ Please Note: kl W 

Use f n and/ r Xaa have been detected in the Sequenc Listing. Please review the yJ / ^ J ^f 
Sequenc Listing to nsure that a corresponding explanation is presented in th <220> to 
<223> fields f each sequence which presents at least one n or Xaa. 




VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/820,843 



' DATE: 04/18/2001 
TIME: 07:35:08 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\04182001\I820843.raw 



L:10 M:270 C: Current Application Number differs, Replaced Current Application No 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:20 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 1 
L:25 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 1 
L:2886 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:32 
L:2886 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:32 

L:4688 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:66 
L:4688 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:66 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 




re application of 
Samir Kumar BRAHMACHARI et al. 

Appln. No. : 09/820,843 Group Art Unit: Not Yet Assigned 

Filed: March 30, 2001 Examiner: Not Yet Assigned 

Confirmation number: 7045 

For: A COMPUTATIONAL METHOD FOR THE IDENTIFICATION OF CANDIDATE 
PROTEINS USEFUL AS ANTI-INFECTIVES 

STATEMENT TO SUPPORT FILING AND SUBMISSION IN 
n ACCORDANCE WITH 37 C.F.R. §§ 1.821-1.825 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box SEQUENCE 



til 



in 



Sir: 

In connection with a Sequence Listing submitted concurrently herewith, the undersigned 
hereby states that: 

1. the submission, filed herewith in accordance with 37 C.F.R. § 1.821(g), does not 
include any new matter; 

2. the content of the attached paper copy and the attached computer readable copy of 
the Sequence Listing, submitted in accordance with 37 C.F.R. § 1.821(c) and (e), respectively, 
are the same; and 

3. all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true, and further, that these statements were 
made with the knowledge that willful false statements and the like so made are punishable by 



STATEMENT TO SUPPORT FILING AND SUBMISSION 
IN ACCORDANCE WITH 37 C.F.R. §§ 1.821-1.825 



Q63915 



fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent resulting 
therefrom. 



Respectfully submitted, 




SUGHRUE, MION, ZINN, Drew Hissong 

MACPEAK & SEAS, PLLC Registration No. 44,765 

2100 Pennsylvania Avenue, N.W. 
Washington, D.C. 20037-3213 
Telephone: (202) 293-7060 
Facsimile: (202) 293-7860 



Date: May 18, 2001 



